Executive summary

This report provides an information about patients from Wuhan infected with COVID-19 between 2020-01-10 19:45:00 and the 2020-02-18. The data contains medical examinations of patients aged from 18 up to 95. Thanks to visualizations we are able to distinguish how the death/recovery ratio look like. In fact it is easy to observe that most of dying people occurs between 60-75, and many people recover in age 20-50. This might be caused by various factors which are included in graphs. Those factors are most likely correlated with old age, and are in most cases far from norm values.

Infected people age distribution

At the figure presented below we can notice that in both male and female cases, the most frequent patients that died due to COVID-19 were in advanced age - with an average above 60 years. For the recovered patients in both male and female cases the count is more or less uniformly distributed in all age range from 20 to 70.

Days to death/recovery

At the figure presented below we can see that the mortality rate is highly correlated with the number of days elapsed till the patient was diagnosed in the hospital. The critical point at which it highly drops is around 15 days of being infected - after this time, the recovery chance highly rises. We can also see that the average time it takes for the patient to die is below 10 days while the recovery time is around 15 days.

Interesting observations

The figure represents correlation of High sensitivity C-reactive protein concentration to the outcome of infected person. These are one of the so-called acute phase proteins appearing in the blood as a consequence of inflammation. In a healthy person, the concentration is not high - no more than 5 mg/l and in fact it can be seen that most patients recovered have their concentration within limits while for the people that died, the concentration were much higher (around 18 times higher than normal level).

The second one - eGFR called glomerular filtration is a measurement of the amount of blood that gets filtered by the kidneys. Normally the result should be greater than or equal to 90 ml/min/1.73m2. For the people that died (both male and female cases) it can be seen that the level on average has dropped below the norm. For the “recovered” cases there is a big difference between gender - almost all female cases have value of eGFR witihin the norm, while the values for males vary greatly. Half of the cases are below norm and the other half is witihin it. It can suggest that based on the aquired data, the eGFR metric is much more accurate for female than male.

Possible causes of death

The next set of visualizations represent the outcome of infected patients based on four metrics and their age. Dotted lines represent the range of normal values for given metric. We can observe that most patients with abnormal values are in advanced age - above 60 age. The concentration for all metric, for deceased patients, falls on average on 70 years. The effect of abnormal values on the outcome of a patient is particularly visible for Neutrophils and Lymphocyte metrics, where almost all patients with abnormal values are deceased and those within the norms are recovered.

Importance of Prothrombin

Prothrombin is one of most important protein made in our body. It is known of crucial involvement in blood clotting (coagulation factor). In case of any injury Prothrombin works with other clotting factors to stop the bleeding. When our coagulation factor drops to low, it might cause a serious blood lose. But too high level may also be very harmful causing arteries or veins.

Prothrombin activity

Next useful metric is “Prothrombin activity”. It is a protein factor responsible for the formation of thrombin; The percentage of the norm is in the range of 70-130%. Two groups can be observed - first with almost all values within the correct range (with some outliers) presenting recovered patients and the second one containing data from deceased patients (concentration of these values is around 60%). As with previous metrics, main group of abnormal values for prothrombin activity are people in advanced age.

Prothrombin efficiency

Below there is also a plot showing the density of amount of people recovered and deceased based on the prothrombin activity time. It is quite visible that most cases recovered have the activity of around 13.5 seconds while the activity time rises, the amount of recovered people drastically falls and mortality rises.

International standard ratio (INR)

The international normalized ratio (INR) is a standardized number, it is assumed that the norm is between 0.8-1.2. But the results of this test might be very dependent of the age, gender and heath history of the patient. In fact many factors may cause a deviation from the norm. From the data we are able o observe that patients who recovered from Covid-19 are very close to norms, however those who died oscillate between 1 to 5.

Interesting findings

It might be interesting that the PATIENT ID is very highly correlated to death, in fact all patients below id 200 recovered, while above all died.

The graph below represents found phenomena. :)

meme :)